Planning and Scheduling in Hybrid Domains Using 
Answer Set Programming 



Sandeep Chintabathina 

University of Arkansas at Pine Bluff, Pine Bluff AR 71601, USA 
chintabathinasOuapb . edu 



Abstract. In this paper we present an Action Language-Answer Set Program- 
ming based approach to solving planning and scheduling problems in hybrid do- 
mains - domains that exhibit both discrete and continuous behavior. We use action 
language H to represent the domain and then translate the resulting theory into 
an A-Prolog program. In this way, we reduce the problem of finding solutions 
to planning and scheduling problems to computing answer sets of A-Prolog pro- 
grams. We cite a planning and scheduling example from the literature and show 
how to model it in H. We show how to translate the resulting H theory into an 
equivalent A-Prolog program. We compute the answer sets of the resulting pro- 
gram using a hybrid solver called EZCSP which loosely integrates a constraint 
solver with an answer set solver. The solver allows us reason about constraints 
over reals and compute solutions to complex planning and scheduling problems. 
Results have shown that our approach can be applied to any planning and schedul- 
ing problem in hybrid domains. 

1 Introduction 

In this paper we are interested in modeling intelligent agents capable of planning, acting 
and reasoning in a dynamic environment. We are primarily interested in hybrid domains 
- domains that exhibit both discrete and continuous behavior. Many real world situations 
such as a robot in a manufacturing plant, a decision control system of a space shuttle etc. 
deal with both discrete change as well as continuous change. For example, a decision 
control system in a space shuttle is capable of opening and closing valves of fuel tanks 
to supply fuel to some jets. The actions open and close change positions of valves. 
The valves will remain in a certain position as long as no open or close actions occurs. 
The change of position of valves is, therefore, discrete. However, the fuel level in the 
tank can change continuously with time when the valve is open or there is an incoming 
supply of fuel to the tank. This type of continuous change coupled with discrete change 
makes this domain hybrid. We are not only interested in modeUng such domains but 
also to solve planning and scheduling problems in such domains. A simple planning 
and scheduling problem in the above example would be to fire a jet within 10 seconds. 
This would require opening and closing the appropriate valves and delivering fuel to 
the jet in time to achieve the goal. 

To model intelligent agents we need to provide the agent with knowledge about 
its environment and its own capabilities and goals. There are several approaches to 
representing and reasoning about such knowledge. In this paper, we use action language 
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H - a high level language for representing and reasoning about actions and their effects. 
We chose this language mainly because it is capable of representing and reasoning 
about hybrid domains. A description written in H describes a transition diagram whose 
states correspond to possible physical states of the system and whose arcs are labeled by 
actions. A transition, (ff, a, d'), of a diagram denotes that action a is possible in state (7 
and that after the execution of a the system may move to state a'. The diagram consists 
of all possible trajectories of the system. 

Language H ||4l was developed by extending the signature of action language £/ ^ 
Q with a collection of numbers for representing time, a collection of functions of 
time, and fluents with non-boolean values including fluents defined by functions of 
time (process fluents). It also allows triggers which specifies conditions under which 
actions are triggered. The semantics of is based on the McCain-Turner equation 
iPl. A slightly modified version of this equation defines the semantics of H. Thus, both 
languages are based on the same underlying intuition. However, in s;/ Jff a state of the 
transition diagram is a snapshot of the world, whereas in H, a state is a snapshot of the 
world over a time interval. Process fluents are defined over this time interval. Prior to H, 
logic-based formalisms such as Situation Calculus lfT3l and Event Calculus lfT4l were 
extended to reason about process fluents. Both approaches demonstrate via examples 
how their approach can be used for reasoning about process fluents. However, in both 
approaches, it is difficult to express causal relations between fluents. 

To solve planning and scheduling problems in hybrid domains, we use an action 
language-logic programming approach. We begin by representing the domain in H. To 
this extent, we come up with a theory of H (also called action description) to describe 
actions and their effects. We implement this theory by translating it into an Answer Set 
Prolog (A-Prolog) program, a class of logic programs under answer set semantics II7I8II . 
and computing answer sets (models) of the resulting program. In this way, we reduce 
the problem of finding solutions to the planning and scheduling problem to computing 
answer sets of the A-Prolog program. In f5\ we show that there is a one-to-one corre- 
spondence between models of A-Prolog programs and models of our specification. 

The paper is organized as follows. In section 2, we revisit the syntax and seman- 
tics of H as described in |4|. In section 3, we cite a planning and scheduling example 
from lfT2] and come up with a theory of H to model it. In section 4, we present the 
main contribution of this paper which is the translation of the resulting H theory into 
an A-Prolog program. The example from \12] was run using ZENO, a least commit- 
ment planner that supports continuous change. After ZENO, researchers have come up 
with several efficient solvers. One significant contribution is the Planning Domain De- 
scription Language (PDDL) which was exclusively developed for planning purposes. A 
variant of this language, PDDL+ |6 | is capable of modeling continuous change through 
the use of processes and events. Like H, the semantics of this language is defined using 
a labeled transition diagram. The language has been shown to represent a number of 
complex time dependent effects. However, one major limitation of the language is that 
it does not support derived predicates - predicates defined in terms of other predicates 
for eg. above {x,y) from the blocks world domain. There are no such limitations in H. 
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2 Preliminaries 

In this section we revisit the syntax and semantics of H as described in 
2.1 Syntax 

By sort we mean a non-empty countable collection of strings in some fixed alphabet. 
A sorted signature 2^ is a collection of sorts and function symbols. A process signature 
is a sorted signature with special sorts time, action, and process. Sort time is normally 
identified with one of the standard numerical sorts with the exception that it contains 
an ordinal (O such that for any x £ time \ {ft)}, (0 > x. No operations are defined over 
ft). If time is discrete, elements of time \ {ft)} may be viewed as non-negative integers, 
otherwise they can be interpreted as either rational numbers, constructive real numbers, 
etc. 

Sort process contains strings of the form XT.f{T) where T is a variable ranging over 
time and f{T) is a mathematical expression (possibly) containing T. A string XT.f{T) 
represents a function defined over time. The A is said to bind T in f{T). If the expres- 
sion, f{T), does not contain any variables for time then Ar./(r) is said to denote a con- 
stant function. For e.g. Ar.O denotes the constant function 0. For simplicity we assume 
that all functions from process have the same range denoted by the sort range{process) . 
An example of a function from sort process is XT.h — {g/2) *{T ~ t)^ which defines the 
height in meters, at time T, of a freely falling object dropped from a height h,T — t sec- 
onds before. The symbol g denotes the Earth's average gravitational acceleration which 
is equal to 9.8 meters/sec^. 

Sort action is divided into subsorts agent and exogenous. Members of agent denote 
(unit) actions performed by an agent and members of exogenous denote non-agent ac- 
tions. Both agent and exogenous actions will be referred to as actions. A compound 
action is a set of unit actions performed at the same time. 

The collection of function symbols includes names for fluents and standard numerical 
functions. Each fluent name is associated with an arity - a number indicating the number 
of arguments. Intuitively, ^Menf.s are properties that may change as a result of actions. 
For example, the height of a brick held at a certain position above the ground could 
change when it is dropped. Every process signature contains reserved fluents start and 
end of sort time. 

A term of sort s is defined as follows: 

1. A string y e i is a term of sort s; 

2. If f 1 , . . . , f„ are terms of sorts si,.. .,s„ respectively and f : si x . . . x i„ — > i is a 
function symbol then /(fi , . . . ,f„) is a term of sort s. 

Notice that if f{x) is a term of sort process and f is a term of sort time then f{x){t) 
is a term of sort range{process). For example, to represent the height of brick b we 
can introduce a fluent height{b) of sort process. Then by height{b){\Q) we denote the 
height of b at time 10. Similarly, Ar.200 - {g/2) * {T - 5)2(10) denotes the value of 
the function at T = 10 which is equal to 77.5. 

An atom of Z is a statement of the form t where f is a term of some sort s and y £ s. 
Examples of atoms are end — 10, 2 + 3 = 5 etc. If f is a term built from fluent symbols 
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then such an atom is called a fluent atom. Examples of fluent atoms are height (b) = 
Ir.lOO- (g/2) * (r-5)2, height{b){5) = 100, end = 10, etc. 

A literal of T. is an atom f = y or its negation ^{t =y) . Negation of = will be often 
written as ^. If f is a term of Boolean sort then t = true (t ^ false) is often written as t 
and t = false (t ^ true) is often written as -^t. For example, the atom 4 < 5 = true will 
be written as 4 < 5. 

Language, H, is parameterized by a process signature L with standard interpretations 
of numerical functions and relations (such as +, <, <, 7^, etc). 

Deflnition 1. An action description of H{E) is a collection of statements of the form: 



where e's are elements of action, Iq's are fluent atoms and /i, . . . ,Z„ are literals of the 
signature of H. Iq is referred to as the head of a statement and Zi , . . . , /„ are referred to 
as the body of a statement. A statement of the form (1) is called a state constraint. It 
guarantees that any state satisfying li,...,l„ also satisfies /q. A statement of the form 
(2) is called a dynamic causal law and it states that if action e were executed in a state 
satisfying literals ly,. . . ,l„ then any successor state would satisfy Iq. A statement of the 
form (3) is called an executability condition and it states that actions ei,...,em cannot 
be executed in a state satisfying /i , . . . , Z„. If n = then is dropped from statements 
(1), (2) and (3). A statement of the form (4) is called a trigger and it states that action e 
is triggered in any state satisfying li,.. .,l„. 

By ground instantiations of a variable of sort s, we mean the elements of s. From 
the description of the syntax, the only variable that appears in the statements of H is the 
variable T ranging over time. However, variables ranging over other sorts are allowed in 
the statements as long as those statements are viewed as a shorthand for the collection 
of statements obtained by replacing each occurrence of a variables other than T by its 
corresponding ground instantiations. 

2.2 Semantics 

The semantics of language H is based on a slightly modified McCain-Turner equation 
||9l. An action description, AD, of H(Z) describes a transition diagram, TD{AD), whose 
nodes correspond to possible physical states of a system and whose arcs are labeled by 
actions. A transition {s,a,s') of the diagram denotes that action a is possible in s and as 
a result of execution of a the system may move to state s'. It is important to note that 
an action description of H can be either deterministic (i.e. for any state-action pair there 
is at most one successor state Q) or non-deterministic (i.e. there is a state-action pair 
with more than one successor state). In this section we will give a formal definition for 
a state and a transition of TD{AD). We begin with interpreting symbols of E. 

Definition 2. Given an action description AD of H{E), an interpretation / of Z is a 
mapping defined as follows. 



Iq ifli,...,l„. 
e causes Iq if h,. . . ,l„. 
impossible ei, . . . ,em if h 
;i,...,;„ triggers e. 



(1) 

(2) 



(3) 
(4) 
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- for every non-process sort, s, and every string y € s, I maps y into itself i.e. y = y. 

- standard interpretation is used for the sort process and other standard numerical 
functions and relations. 

- / maps every fluent into a properly typed function. 

Often an interpretation / of is identified with a coUection, s{I), of atoms of the form 
t = y such that t' =y where t and y are terms of some sort. In other words, s{I) = {t = 

y\t'=y}- 

Before we give the definition of a state of TD{AD) let us consider the following defini- 
tions. 

A set, s, of atoms is said to be consistent if for every atom t =yi £ s, ^2 such that 
t =y2&s and ^yi- 

Let us define what it means for a literal to be true w.r.t a set of atoms of Z. 
Definition 3. Given a consistent set, L, of atoms of E 

- An atom t = y is true in L (symbolically L \= t = y) \ii t = y € L. 

- A Uteral t ^yis true in L {L\= t ^ y) L\= t = yo and y ^ yo- 

We wiU now define what it means for a set of atoms to be closed under state constraints 
of AD. 

Definition 4. A setL of atoms is closed under the state constraint 

/o if /i , . . . , /« 

of AD if, whenever L |= /, for every i, \ <i <n,L\= Iq. 

A set L of atoms is closed under state constraints of AD if L is closed under every state 
constraint of AD. 

Next, we define what it means for a set of atoms to satisfy a trigger of AD. 
Definition 5. A setL of atoms of H satisfies a trigger 

triggers e 

of AD iff L 1= for every i such that I <i<n. 

Intuitively, if a set of atoms satisfies a trigger it means that the corresponding action 
wiU take place at some time point. The next definition characterizes sets of atoms which 
"end" before any triggered action takes place. 

Definition 6. A set L of atoms ofH is closed under triggers of AD iff ^ L' such that L' 
satisfies at least one trigger of AD and L\L' = {end = ^2} and L'\L= {end = ?i } and 

tl < t2. 

Intuitively, a state of TD{AD) can be viewed as a collection of functions of time defined 
over an interval. The endpoints of the interval are implicitly defined by the reserved 
fluents start and end. The domain of each function is the set {t \ start <t< end At < 
(o}. We say that a state is defined over an interval of the form [start, end] iff end ^ (0. 
There is at least one arc labeled by an action leading out of such a state. We say that 
a state is defined over an interval of the form [start, end) iff end = co. There is no arc 
leading out of such a state. States that begin at time are called initial states. They 
define the initial conditions of a domain. Here is the formal definition of a state. 
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Definition 7. Given an interpretation / of Z, s{I) is a state of TD{AD) if each of the 
following holds. 

- s{I) is a collection of atoms of the form t = y such that t' =y where t and y are 
terms of the same sort. 

- s{I) is closed under the state constraints of AD. 

- If s{I) \= start = ti and s{I) \= end = t2 then ti < ?2 A?i < (0. 

- s{I) is closed under the triggers of AD. 

- If s(/) \= p = XT.f{T) where pis a fluent of sort process then XT.f{T) is defined 
over the domain {t \ start' <t< end' At < co}. 

- If /5 is a fluent of sort process and f is a term of sort time then s{I) \= p{t) = x iff 
sil) ^p^XT.fiT) and XT.f{T){t')=x. 

By deflnition of interpretation every symbol is mapped uniquely. Therefore, states of 
TD(AD) are complete and consistent. Whenever convenient the parameter / will be 
dropped from s{I). Next, we wiU define what is means for an action to be possible in a 
state. 

Definition 8. Action a is possible in state s if for every non-empty subset ao of a, there 
is no executabiUty condition 

impossible ao if /i , . . . , /«. 
of AD such that s \= U for every i, \ <i<n. 

Given a state s and action e let us define what are the direct effects of executing e in s. 

Definition 9. Let e be an elementary action that is possible in state s. By Egie) we 
denote the set of all direct effects of e w.r.t s. 

Es{e) = {/o I e causes /q if /i ,...,/« e AD A s |= /,■ for every I <i<n} 

If a is a compound action then = {jeea^si^)- 

The foUowing definition aUows us to identify sets of atoms with adjacent intervals. 

Definition 10. Let x,y, and z be elements of sort time such that x <y < zAy < (0 and 

s and s' be sets of atoms of H. We say that s' follows siff s\= {start = x,end = y} and 

s' \= {start = y,end ~ z\. 

Given two sets of atoms s and s' , the function rv(.s-') is defined as follows. 

, _ / {start = t\ , end = t2} if s' follows s A {start = t\ , end = t2} C s' . 
"^■^ ^ ~ \ otherwise. 

In other words, the function returns the interval of s' if s' follows s; otherwise it returns 
an empty set. 

The consequences of a set of atoms w.r.t a set of state constraints is defined as follows. 

Definition 11. Given a set S of atoms and a set Z of state constraints of AD the set, 
Cnz{S), of consequences of S under Z is the smallest set of atoms (w.r.t set theoretic 
inclusion) containing S and closed under Z. 
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Definition 12. Action a is complete w.r.t a set of atoms s if for every trigger r e AD of 
the form 

/i,...,/„ triggers e 

e € fl iff i satisfies r. 

We know that a state contains arbitrary atoms of T. . However, for the next definition we 
focus only on fluent atoms including those formed from start and end. All other atoms 
such as 2 + 3 = 5 which is always true will be ignored. 

Definition 13. A transition diagram TD{AD) is a tuple {(p,yf) where 

- is the set of states. 

- 1// is the set of all transitions {s,a,s') such that each of the following holds. 

• fl is complete w.rt s 

• a is possible in s 

• s' is closed under the triggers of AD 
• 

/ = Cnz{E,{a) U {sHs') U T,{s')) (1) 

where Z is the set of state constraints of AD. 

The set, Es{a), consists of direct effects of a while the set, sDs', consists of facts pre- 
served by inertia. Ts{s') projects the start and end of s'. The application of Cnz to the 
union of these sets adds the indirect effects. From the definition, it is evident that con- 
current actions such as dropping two balls from different heights at the same time can 
be handled in H. 

3 A planning and scheduling example 

In this section we will visit a planning and scheduling example from lfT2l and demon- 
strate how to model it in H. Consider a toy world in which a single plane moves passen- 
gers between cities. Slow flying travels at 400 miles per hour and consumes 1 gallon of 
fuel every 3 miles, on average. Fast flying travels at 600 miles per hour and consumes 1 
gallon of fuel every 2 miles. Passengers can be boarded in 30 minutes and deplaned in 
20 minutes. Refueling gradually increases the fuel level to a maximum of 750 gallons, 
taking one hour from an empty tank. Boarding, deplaning, and refueling must all occur 
while the plane is on the ground. The distance between city-a and city-b is 600 miles, 
the distance between city-a and city-c is 1000 miles, the distance between city-b and 
city-c is 800 miles and the distance between city-c and city-d is 1000 miles. Suppose 
that Dan and Ernie are at city-c, but the empty plane and Scott are at city-a. If the plane 
only has 500 gallons of fuel, how can we ensure that Scott and Ernie get to city-d in 
less than five and a half hours? lfT2l 

A solution to this problem requires reasoning about effects of (concurrent) actions in 
the presence of continuous time. There are properties of the domain that change contin- 
uously with time such as the fuel level of the plane, the distance covered by the plane 
etc. With proper planning and scheduling, the plane will make its tight schedule with- 
out running out of fuel. Hence, the problem is a planning and scheduling problem in the 
presence of continuous change. We begin with the representation of the domain in H. 
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3.1 Representation in H 

We begin with the description of the signature. It consists of several objects - persons 
scott, ernie, and dan and locations a,b,c, and d. We will use another location called 
enroute to denote that the plane is in the air. We will use (possibly indexed) vari- 
ables P and L to denote persons and locations respectively. We are given the constants 
distance{a,b, 600), distance{a,c, 1000) and so on. We are also given the fuel consump- 
tion rates for different speeds. For example, when flying at 400 mph the mileage is 
3 miles per gallon. We can encode this information in the form of constants /c(400) 
and /c(600) which denote 3 miles per gallon and 2 miles per gallon respectively. For 
modeling purposes we will use minutes as our unit of time. 

We know that there are several durative actions in the domain, for example, board- 
ing which takes 30 minutes. We will use the approach used by Reiter |fT3l to model 
durative actions. We introduce the action start _boarding{P,L) which denotes that per- 
son P is boarding at location L and end _boarding{P,L) which denotes that person P 
has finished boarding at location L. So instead of one action we have two actions 
start -boarding{P,L) and end JboardingiP^L) which make the fluent boarding{P,L) true 
and false respectively. In our model, the endJ}oardmg(PL} action will be triggered 30 
minutes from the time start J}oarding{P,L) is executed. We will use the same approach 
for all durative actions. We introduce the action start _flying{Li,L2,S) to denote that 
the plane has started flying from location Li to location L2 with speed S. The variable S 
ranges over {400, 600}. The remaining actions in the domain are start _deplaning{P,L), 
endjdeplaning{P,L), start -refueling, end-re fueling, and end -flying{L\ ,L2) ■ 
The signature consists of the boolean fluents boarding{P,L), deplaning{P,L), onJboa- 
rd{P) and refueling. We also have non-boolean fluents location{P) and location(plane) 
which range over {a^b, c. d, enroute}. We have process fluents timeJeftJ}oard(PL} and 
timeJeftudeplane(PL) which are clock functions to count down the time left for person 
P to board and deplane at location L respectively. We have process fluent distance. 
left{Li,L2) to denote the distance that is yet to be covered to reach L2 from Li . Finally, 
the process fluent fuelJevel denotes the fuel level in the plane's tank. We proceed with 
defining the effects of each action. The effects of start -boarding{P,L) are defined using 
dynamic causal laws 

start _boarding{P,L) causes boarding{P,L). 

start _boarding{P,L) causes timeJeftJ>oard{P,L) = Ar.mflx(0,30— {T — Tq)) 

if end — Tq. 

The following executability condition says that person P cannot start boarding if he or 
she is already boarding. 

impossible start _boarding(P,L) if boarding{P,L). 

The following are causal laws involving end _boarding{P, L) . 

end Jboarding{P,L) causes -^boarding{P,L) . 
end J)oarding{P,L) causes onJyoard{P). 
impossible end -boarding{P,L) \i ^boarding{P,L) . 
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The following are causal laws involving actions start -de planing {P,L) and endjdepla 
ning(P,L). 

start jde planing {P,L) causes de planing {P,L) 
start jde planing {P,L) causes 

timeJeftjdeplane{P,L) = XT.max{0,20— (T — Tq)) 

if end = Tq. 

impossible start _deplaning{P,L) if deplaning{P,L). 
end -deplaning{P,L) causes ^de planing{P, L) . 
end -deplaning {P,L) causes -•onJboard{P). 
impossible end uie planing {P,L) if -^de planing {P,L). 

The following are direct effects of refueling. Instead of assuming that it takes 1 hour to 
fill up from an empty tank we will assume that the rate of refueling is 20 gallons per 
minute. When we start refueling, we add fuel to the existing level and the level increases 
at the rate of 20 gallons per minute. 

start -refueling causes fuelJevel = XT.max{750,X + 20* {T — Tq)) 

if end = Tq, fuel lev el {end) =X. 
start -refueling causes refueling. 

impossible start ^re fueling if refueling. 
impossible start -re fueling if location{plane) = enroute. 
end-refueling causes -^refueling. 
impossible end -refueling if location{plane) = enroute. 

impossible end-refueling if -^refueling. 

Next, we define the direct effects of start -flying{Li,L2,S). 
start -flying{L\,L2,S) causes location{plane) = enroute. 

start -flying{Li ,L2, S) causes distance Jeft{Li,L2) — XT.max{0,X -~S*{T — To)/60) 

if distance{Li,L2,X), end = TQ. 
start -flying{Li,L2,S) causes fuelJevel = XT.niax{0,X — S * {T — Tq) / (60 * fc{S))) 

if fuel -I eve I {end) ^X, end — Tq 
impossible start -flying{Li,L2,S) if location{plane) = enroute. 
impossible start -flying{L\ .Lo.S) if location{plane) ^ LI. 
impossible if arf_//>7'ng(Li,L2, 5) if boarding {P, LI), 

location{P) =Ll. 

The following are causal laws involving action end-flying{Li,L2). 

erui-flying{Li,Li) causes location{plane) =L2- 
impossible end-flying{Li ,£2) if location{plane) ^ enroute. 

The following causal laws specify conditions under which each "ending" action is trig- 
gered. 

time -le ft -board {P,L, end) = triggers end -boarding {P,L). 
time Jeft^eplane{P,L, end) = triggers end -de planing {P,L). 
fuel Jevel{end) = 750 triggers end -refueling, 
distance de ft (Li , L2 , end) = triggers end -flying (Li , L2) • 
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The following state constraint encodes common sense knowledge. It states that a person 
on board a plane is at the same location as the plane. 

location{P) = L if location{plane) = L,onJ?oard{P). 

The following executability condition states that flying to a location without enough 
fuel is not allowed. A rational reasoner who wants to reach his or her destination safely 
will not violate this constraint. 

impossible start _flying{Li,L2,S) if fuelJevel{end) —X, 

distance{Li,L2,Y), 
X<Y/fc{S). 

This concludes the representation of the domain in H. 



4 Translation into logic program 

We solve the planning and scheduling problem by translating the above theory into an 
Answer Set Prolog program and computing answer sets of the resulting program. Thus, 
we reduce the problem of finding solutions to the planning and scheduling problem to 
computing answer sets of the resulting program. The translation that was provided in 
[5 1 is a theoretical translation from statements of H into rules of A-Prolog. However, in 
this paper we are going to provide a translation that adheres to the syntax of a specific 
ASP solver Since we are dealing with continuous functions we chose a solver that, in 
addition to computing answer sets, is capable of reasoning about constraints over reals. 
Here is a brief summary of solvers that are available at our disposal. 

In the past few years researchers lUlOl have focused on integrating Answer Set 
Programming(ASP) and Constraint logic programming(CLP). They came up with new 
systems that achieve significant improvement in performance over existing ASP solvers. 
The following such systems are available: ACsolver\ 10| (and it's successor Luna ffTl ): 
EZCSP^ and C//n§co?|^ Each solver couples a constraint solver with an answer set 
solver. However, the coupling varies. In Luna (ACsolver) and Clingcon the coupling 
is tight whereas in EZCSP the coupling is loose. We considered using Luna for our 
purposes but we found out that there are some implementation issues. Besides, the 
underlying answer set solver is less efficient than the one used by EZCSP. We also 
considered using Clingcon, however, the underlying constraint solver only deals with 
finite domains which is a limitation when dealing with continuous functions. We chose 
EZCSP as it allows us to reason about constraints over reals. 

In EZCSP, the A-Prolog programs are written in such a way that their answer sets 
encode a constraint satisfaction problem. The system calls the answer set solver, uses 
the resulting answer sets to pose a constraint satisfaction problem in the input language 
of the constraint solver, then calls the constraint solver and combines the solutions re- 
turned by both solvers. The constraint solver can use the results from the answer set 



^ http://marcy.cjb.net/ezcsp/index.html 
^ http://www.cs.uni-potsdam.de/clingcon/ 
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solver to solve new constraints but the answer set solver cannot use results from the 
constraint solver to make new inferences. The current version uses gringo+clasf[^by 
default as the ASP solver and SICSTUS Prolog as the constraint solver Both solvers 
are very fast. 

The translation that we present here will be in the input language of EZCSP. Before 
we begin the translation there are some modeling decisions to be made. These decisions 
are necessary to overcome some of the implementation issues. One of the main issues 
is the implementation of functions. The input language of EZCSP will allow us to rep- 
resent constants and variables but it is not possible to represent functions of time. For 
example, we can translate statements such as velocity — 5 directly into the input lan- 
guage of EZCSP but it is not possible to translate a statement such as height = XT.T^ . 
The reason is that there is no representation for functions in this language. The alterna- 
tive is to provide a value for T and obtain the value of the function for that instance. So it 
is possible to translate the statement height (5) ^ XT.T^{5) —25. The constraint solver 
may obtain the value for T from other constraints and then use that value to evaluate 
height (5). 

Sorts, actions, boolean fluents and fluents ranging over finite domains will be trans- 
lated as atoms in the input language of EZCSP. All other fluents ranging over infinite 
or large domains will be represented as constraint variables. Every constraint involv- 
ing a constraint variable will appear in an atom called required. For example, we write 
required{height > 5) to denote the variable height must be greater than 5. Process flu- 
ents will be represented by constraint variables. For each process fluent we can use two 
constraint variables - one to denote the initial value and one to denote the final value in 
a given state. 

Let us look at some statements from our example and see how they are translated 
into EZCSP rules. Consider the following statement from our example. 

start -refueling causes fuellevel — XT .maxil 5Q ,X + 20 * (T — Tq)) 
if end — Tq, fuellevel [end) =X. 

Since we cannot encode functions in EZCSP we are going to expand the signature to 
include a boolean fluent called refueling, a real-valued fluent called fJime to denote 
the time at which fueling began and two new real-valued fluents - fJnitial and f^final 
to denote the initial and final fuel level in a state. We then replace the above causal law 
with the following causal laws. 

start _re fueling causes fJnitial —X if f_final — X. 
start _re fueling causes fJime = T if end ~ T. 
start _re fueling causes refueling. 

f -final = max{15Q,X + 20* (T — 7b)) if fJnitial — X,fJime = 7b, 

end = T, refueling. 

The first two dynamic causal laws say that executing start _re fueling initializes fJnitial 
to the current fuel level and fJime to the end time of the current state. The next dy- 
namic law says that start ^re fueling begins the refueling process. Next, we have a state 



^ http://potassco.sourceforge.net/ 
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constraint which defines f -final in terms of fJnitial, fJime and end of a state in 
which refueling is true. We consider fluents fJnitial, fJime and refueling to be in- 
ertial. These fluents will retain their values until an action causes them to change. For 
example, fJnitial wiU be reset by actions end .refueling, start -flying{Li,L2,S) and 
end -f lying {L\,L2) because each action has a bearing on the fuel level. So we will add 
dynamic causal laws encoding the effects of these actions on fJnitial. The relationship 
between the process fluent fuelJevel in the original action description and f -final in 
the modified action description is that for any given state, f_final = fuelJevel{end). 

To translate the above statements into EZCSP we have to first define the various 
constraint variables. All the real-valued fluents will be translated as constraint variables. 
Since fluents are dependent on state, we have to consider state as one of its parameters 
during translation. It is important to note that with the assignment of time intervals 
to states, each state in a transition diagram is unique. Our approach is to assign (non- 
negative) integers to each state in the order it appears in the trajectory of a transition 
diagram. This is done in the language of EZCSP as foUows. 

#const n = 10. 

step{0..n). #domain step{I;Il). 

Here n denotes the length of the trajectory and the Momain declaration says that vari- 
ables / and II range over [0,n] . Next, we will define the various constraint variables 
parameterized by step. 

cspvar{fjime{l) , 0,400) . cspvar{fJnitial{I) , 0, 750). 
cspvarlf Jinal (/) , 0, 750) . cspvar{end (/) , 0, 400) . 

The numbers in each declaration specify the range of that variable. Next, the boolean 
fluent re fueling will be translated into the atom v{X,re fueling,!) where X ranges over 
{true, false}. It says that X is the value of refueling in step /. The action start -refueling 
is translated as occurs{start ^re fueling,!) which says that start ^re fueling occurred in 
step /. Finally, here are the EZCSP rules obtained by translating the above causal laws. 

required{f Jnitial{!\) == f _final(I)) : — occurs{st art _re fueling, !),!\ =/+ 1. 
required{f Jime{!\) == end{!)) : — occurs{start -re fueling, !),!\ =I+\. 
v{true, refueling, ! +1) : —occurs{start_re fueling,!). 

required{f -final{!) == niax{7 50, fJnitial{!)+ 20* {end {!) — f J ime {!)))) : — 

v{true, refueling, !) . 

The rest of the dynamic causal laws are translated using a similar approach. As men- 
tioned before, in EZCSP the answer set solver does not receive any input from the con- 
straint solver to make new inferences. This poses a problem especially when translating 
triggers. A trigger mentions the conditions under which an action will be executed. It 
is possible that these conditions are constraints over time. Even if these constrains are 
evaluated, since there is no feedback to the answer set solver, it is not possible to know 
when an action is triggered. The author of EZCSP, Marcello Balduccini, has suggested 
a solution to overcome this problem. Suppose that we have the following trigger. 



fuel Jevel{end) = 750 triggers endure fueling. 
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Since we have replaced process fluent fuelJevel with other real valued fluents, this 
trigger is reaUy 

/-initial =X,fJime = T, re fueling, end = (750 — Z)/20 + r triggers end -re fueling 

Given the initial fuel level X and the refuel rate of 20 gallons per minute, the expression 
(750 — X) /20 gives the number of minutes it takes to fill up the tank. The trigger states 
that if a state ends at a time when the tank is full then end-refueling is triggered. Of 
course, refueling must be also be true in that state. A direct translation of this trigger is 

occurs{end-re fueling, I) : — required {end (I) == {750 — f-initial{I))/20 + fJime{I)), 

v{true, refueling ,1). 

This rule will be fired if the body is satisfied. The atom v{true, re fueling, I) is a direct 
consequence of start-refueling. However, the constraint atom in the body of the rule 
is not obtained from other rules of the program. Balduccini suggested that it can be 
generated by adding the rules 

l{p{I),c/{I)}l : —v{t rue, re fueling,!). 

required{end{I) == {750 — f -initial {I))/ 20 + f Jime{I)) : —p{I)- 
required{end{I) < {750- fJnitial{I))/20 + fJime{I)) : -q{I). 

The choice rule will allow us to generate one of the two constraints. If no action takes 
place before the tank fills up then end-refueling is triggered. We use the same approach 
to translate other triggers in the domain. Currently, there is no formal proof stating that 
this solution will always work. However, the solution is based on answer set program- 
ming methodology and EZCSP uses a state-of-the-art answer set solver. We rely on the 
solver to give us the correct solutions. 

In our example, the goal is to have Scott and Ernie in city d in less than five and a 
half hours (330 minutes). We encode this goal in EZCSP as follows 

goal{I) : —v{d,location{scott),I),v{d,location{ernie),I), 
required{start{I) < 330). 

where start {I) is a constraint variable denoting the start time of a step. The constraint 
says that the goal state must be achieved in less than five and a half hours. It is generated 
by adding the rules 

l{^(7),n^(/)}l. 

re quired {start {I) < 330) : —g{I). 
required{start{I) >= 330) : —ng{I). 

We add the following rules to say that failure is not an option. 

success : —goal {I). 
: —not success. 

Next, we will talk about the planning component of the program. As in the case of non- 
continuous domains, we will use Answer Set Programming techniques for generating 
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and testing plans. A plan is a sequence of actions. We will be generating all possible 
sequences of actions that will lead us to our goal. In the process of generating these 
plans we use the constraints in our program to test these plans and discard the ones that 
violate the constraints. A simple way to generate plans is to use the choice rule 



where n is the length of the plan. Note that we are going to generate only the initiating 
actions for example, start_re fueling etc. This is because all the terminating actions have 
triggers associated with them and there is no need to generate them again. Since we are 
deaUng with continuous time, in addition to computing the sequences of actions, we 
will also determine when these actions will take place. This is the reason why planning 
in hybrid domains also involves scheduling. According to the problem specification, we 
have a constraint that specifies the time within which the goal has to be achieved i.e. 
five and a half hours. We can specify this constraint in EZCSP as follows. 



The rules states that all action occurrences must take place before five and a half hours. 
This constraint gives a very broad range of scheduling possibilities. The times during 
which actions will take place could now range over intervals of time. For example, in 
order to achieve our goal it may be required that Scott has to board the plane within 
the first 50 minutes and that the plane has to depart city c no later than three hours 
and 45 minutes into the trip. Dealing with time intervals is an issue because it does not 
allow us to compute values of fluents at specific time points. More issues arise when 
fluents are defined in terms of other fluents whose values are unknown. To overcome 
these issues we decided to assign specific times to the initiating actions. The times for 
the terminating actions were determined by our triggers. Here is a solution given by 
EZCSP to our planning and scheduling problem. 



It is necessary to refuel in city c because there is not enough fuel to travel to city d. 
A shorter plan can be obtained by allowing concurrent actions. For example, refueling 
and boarding can start at the same time. The running times and the performance of the 
solver will be discussed in a longer version of the paper. 



{occurs{A,I) : action{A)} : —step{I),I < n. 



required{end(I) < 330) : —occurs{A,I),action{A). 



occurs{start J}oarding(scott ,a) , 0) end{0) 
occurs{endJ>oarding{scott,a), 1) end{l) 
occurs{start-flymg{a, c,400) , 2) end{2) 
occurs{end_flying{a,c) , 3) end{3) 
occur s{st art -re fueling, 4) end{4) 
occurs{startjboarding{ernie, c) , 5) end{5) 
occurs{end_re fueling, 6) end{6) 
occurs{endJ>oarding{ernie, c) , 7) end{l) 
occurs{start-flying{c,d, 600), S) end{S) 
occurs{end-flying{c, d) , 9) end{9) 
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5 Conclusions and Future Work 

In this paper, we presented an Action Language-Answer Set Programming based ap- 
proach to solving planning and scheduling problems in hybrid domains. We used action 
language H to model a planning and scheduling example and translated the resulting 
theory into an A-Prolog program. We used a hybrid solver called EZCSP to compute 
the answer sets of the resulting program. Our approach overcomes the limitations of 
existing formaUsms such as PDDL+. We believe that our approach can be applied to 
any planning and scheduling problem in hybrid domains. 

In the future, we would like to model several benchmark examples and compare the 
performance of EZCSP with existing planners. Some of the planners used for industry- 
sized problems are domain-specific HI. It will be useful to investigate why some of 
these planners work really well. There are areas for improvement including the effi- 
ciency of solvers, the expressiveness of the input language and so on. 
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